feat: add NaN/Inf detection in learning pipeline by jdbloom · Pull Request #21 · NESTLab/GSP-RL

jdbloom · 2026-04-09T23:36:59Z

Summary

Adds _check_nan() utility function to learning_aids.py
Guards all 5 learning functions (DQN, DDQN, DDPG, TD3, RDDPG) with NaN/Inf checks after loss.backward()
Raises RuntimeError with step context if NaN detected, enabling crash dumps in the diagnostics system

Test plan

5 new unit tests for _check_nan (float NaN, float Inf, normal float, tensor NaN, normal tensor)
223 existing GSP-RL tests pass with no regressions

🤖 Generated with Claude Code

…unctions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

learn_attention was the only learn function missing the NaN/Inf detection guard added in the previous commit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Convergence tests were nondeterministic — no torch/numpy/env seeds were set, so CI results depended on random initialization. Add deterministic seeding (SEED=42) for torch, numpy, and gymnasium env resets. Lower Pendulum improvement threshold from 50% to 20% — 100 episodes is tight for continuous control and 20% improvement over random already demonstrates learning. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

jdbloom and others added 3 commits April 9, 2026 14:28

feat(learn): add NaN/Inf detection after loss.backward in all learn f…

5f88894

…unctions Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

fix(learn): add _check_nan guard to learn_attention

a4afa57

learn_attention was the only learn function missing the NaN/Inf detection guard added in the previous commit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

jdbloom merged commit f4eeb52 into main Apr 10, 2026
4 checks passed

jdbloom mentioned this pull request Apr 13, 2026

feat(actor): expose per-step GSP prediction loss via last_gsp_loss #23

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add NaN/Inf detection in learning pipeline#21

feat: add NaN/Inf detection in learning pipeline#21
jdbloom merged 3 commits intomainfrom
feat/nan-detection

jdbloom commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jdbloom commented Apr 9, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant